Search CORE

21 research outputs found

Fully automated sequence alignment methods are comparable to, and much faster than, traditional methods in large data sets: an example with hepatitis B virus

Author: Amanda C. Owings
Andrea K. Thomer
Andrew D. Sweet
Andrew H. Debevec
Aron D. Katz
Bret M. Boyd
Felipe N. Soto-Adames
Julie M. Allen
Nam-phuong D. Nguyen
Rhiannon M. Peery
Therese A. Catanach
Publication venue: 'PeerJ'
Publication date: 01/01/2019
Field of study

Aligning sequences for phylogenetic analysis (multiple sequence alignment; MSA) is an important, but increasingly computationally expensive step with the recent surge in DNA sequence data. Much of this sequence data is publicly available, but can be extremely fragmentary (i.e., a combination of full genomes and genomic fragments), which can compound the computational issues related to MSA. Traditionally, alignments are produced with automated algorithms and then checked and/or corrected “by eye” prior to phylogenetic inference. However, this manual curation is inefficient at the data scales required of modern phylogenetics and results in alignments that are not reproducible. Recently, methods have been developed for fully automating alignments of large data sets, but it is unclear if these methods produce alignments that result in compatible phylogenies when compared to more traditional alignment approaches that combined automated and manual methods. Here we use approximately 33,000 publicly available sequences from the hepatitis B virus (HBV), a globally distributed and rapidly evolving virus, to compare different alignment approaches. Using one data set comprised exclusively of whole genomes and a second that also included sequence fragments, we compared three MSA methods: (1) a purely automated approach using traditional software, (2) an automated approach including by eye manual editing, and (3) more recent fully automated approaches. To understand how these methods affect phylogenetic results, we compared resulting tree topologies based on these different alignment methods using multiple metrics. We further determined if the monophyly of existing HBV genotypes was supported in phylogenies estimated from each alignment type and under different statistical support thresholds. Traditional and fully automated alignments produced similar HBV phylogenies. Although there was variability between branch support thresholds, allowing lower support thresholds tended to result in more differences among trees. Therefore, differences between the trees could be best explained by phylogenetic uncertainty unrelated to the MSA method used. Nevertheless, automated alignment approaches did not require human intervention and were therefore considerably less time-intensive than traditional approaches. Because of this, we conclude that fully automated algorithms for MSA are fully compatible with older methods even in extremely difficult to align data sets. Additionally, we found that most HBV diagnostic genotypes did not correspond to evolutionarily-sound groups, regardless of alignment type and support threshold. This suggests there may be errors in genotype classification in the database or that HBV genotypes may need a revision

Directory of Open Access Journals

University of Nevada, Reno ScholarWorks Repository

Modeling and Rendering for Realistic Facial Animation

Author: A Lee
B Guenter
C Loop
C Rocchini
D DeCarlo
E Lafortune
F Pighin
H Hoppe
H Rushmeier
H Rushmeier
I Essa
J Cassell
K Waters
L Williams
P Debevec
P Shirley
R Stephen
RL Cook
S Andrew
S Marschner
V Blanz
Y Lee
Y Yizhou
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Commentaries on viewpoint : physiology and fast marathons

Author: Andrade David C
Angeloudis Konstantinos
Balestrini Christopher S
Bielko Shane A
Blagrove Richard C
Bontemps Bastien
Bosch Andrew
Bottoms L
Boullosa Daniel
Brietzke Cayque
Böning Dieter
Chapman Robert F
Costa Campos Yuri de Almeida
Craighead Daniel H
Debevec Tadej
Del Coso Juan
Del Rio Rodrigo
Dewolf Arthur H
Dos Santos Tony Meireles
Escalera Albaro
Fernandes da Silva Sandro
Fernandes Ricardo J
Franco-Alvarenga Paulo Estevão
Gabler Mikaela C
González-Mohíno Fernando
González-Ravé José María
González-Rayas José Manuel
González-Yáñez José Manuel
Goss Curtis S
Gronwald Thomas
Guppy Fergus M
Hansen Rasmus K
Hayes Philip R
Holmberg Hans-Christer
Hoogkamer Wouter
Hottenrott Kuno
Hottenrott Laura
Hunter B
Ives Stephen J
Kipp Shalaya
Knechtle B
Kram Rodger
Layec Gwenael
Leist Margaret A
Lepers Romuald
Lige Mast T
Louis Julien
Macedo Vianna Jeferson
Malatesta Davide
Malysa William
Millet Gregoire P
Moreira Silva Bruno
Muniz-Pardos Borja
Muniz-Pumares D
Nikolaidis P T
Oliveira Pires Flávio
Oumsang Alicia S
Paris Hunter L
Pereira Guimarães Miller
Perrey Stephane
Pitsiladis Yannis
Proessl Felix
Ramirez-Campillo Rodrigo
Rayas-Gómez Ana Lilia
Ribeiro Lopes Thiago
Riveros-Rivera Alain
Santos-Concejero Jordan
Secher Niels H
Senefeld Jonathon W
Senefeld Jonathon W
Silva Marques de Azevedo Paulo Henrique
Sinai Erin C
Sperlich Billy
Stapley Paul
Sutehall Shaun
Ušaj Anton
Valenzuela Pedro L
Vilas-Boas João Paulo
Volianitis Stefanos
Weyand Peter G
Yates Brandon A
Zinner Christoph
Publication venue: 'American Physiological Society'
Publication date: 01/04/2020
Field of study

Q2Q1N/

University of Brighton Research Portal

VBN

Repositorio Institucional - Pontificia Universidad Javeriana

Genome_trees.zip

Author: Allen Julie M.
Boyd Bret M.
Catanach Therese A.
Debevec Andrew H.
Katz Aron D.
Nguyen Nam-phuong D.
Owings Amanda C.
Peery Rhiannon M.
Soto-Adames Felipe N.
Sweet Andrew D.
Thomer Andrea K.
Publication venue
Publication date: 30/01/2019
Field of study

Tree files estimated from sequence alignments of hepatitis B virus genomes. Trees are best maximum likelihood (ML) trees with bootstrap support values. Includes trees based on MUSCLE, manual, and PASTA genome alignments

Dryad Digital Repository (Duke University)

FigShare

GI_Clustering.zip

Author: Allen Julie M.
Boyd Bret M.
Catanach Therese A.
Debevec Andrew H.
Katz Aron D.
Nguyen Nam-phuong D.
Owings Amanda C.
Peery Rhiannon M.
Soto-Adames Felipe N.
Sweet Andrew D.
Thomer Andrea K.
Publication venue
Publication date: 30/01/2019
Field of study

Initial files of hepatitis B virus sequences clustered according to GenBank GI number

Dryad Digital Repository (Duke University)

FigShare

Genome_consensus_sequence.fasta

Author: Allen Julie M.
Boyd Bret M.
Catanach Therese A.
Debevec Andrew H.
Katz Aron D.
Nguyen Nam-phuong D.
Owings Amanda C.
Peery Rhiannon M.
Soto-Adames Felipe N.
Sweet Andrew D.
Thomer Andrea K.
Publication venue
Publication date: 30/01/2019
Field of study

Consensus sequence of hepatitis B virus genomes. This sequence was used as a reference for HBV manual alignments

Dryad Digital Repository (Duke University)

FigShare

Supplementary_Information.docx

Author: Allen Julie M.
Boyd Bret M.
Catanach Therese A.
Debevec Andrew H.
Katz Aron D.
Nguyen Nam-phuong D.
Owings Amanda C.
Peery Rhiannon M.
Soto-Adames Felipe N.
Sweet Andrew D.
Thomer Andrea K.
Publication venue
Publication date: 30/01/2019
Field of study

List of commands used in software programs for the alignment and tree estimation of hepatitis B virus sequences

Dryad Digital Repository (Duke University)

Cleaned_GenBank_Files.zip

Author: Allen Julie M.
Boyd Bret M.
Catanach Therese A.
Debevec Andrew H.
Katz Aron D.
Nguyen Nam-phuong D.
Owings Amanda C.
Peery Rhiannon M.
Soto-Adames Felipe N.
Sweet Andrew D.
Thomer Andrea K.
Publication venue
Publication date: 30/01/2019
Field of study

Hepatitis B virus GenBank files after initial data filtering steps

Dryad Digital Repository (Duke University)

FigShare

Genotype_trees.zip

Author: Allen Julie M.
Boyd Bret M.
Catanach Therese A.
Debevec Andrew H.
Katz Aron D.
Nguyen Nam-phuong D.
Owings Amanda C.
Peery Rhiannon M.
Soto-Adames Felipe N.
Sweet Andrew D.
Thomer Andrea K.
Publication venue
Publication date: 30/01/2019
Field of study

Tree files used for genotype occupancy tests in hepatitis B viruses. Trees estimated from manual or PASTA genome alignments. Files include .tre and .xml format

Dryad Digital Repository (Duke University)

FigShare